The Graduate School PENALIZED QUADRATIC INFERENCE FUNCTIONS FOR VARIABLE SELECTION IN LONGITUDINAL RESEARCH
نویسندگان
چکیده
For decades, much research has been devoted to developing and comparing variable selection methods, but primarily for the classical case of independent observations. Existing variable-selection methods can be adapted to cluster-correlated observations, but some adaptation is required. For example, classical model fit statistics such as AIC and BIC are undefined if the likelihood function is unknown (Pan, 2001). Little research has been done on variable selection for generalized estimating equations (GEE, Liang and Zeger, 1986) and similar correlated data approaches. This thesis will review existing work on model selection for GEE and propose new model selection options for GEE, as well as for a more sophisticated marginal modeling approach based on quadratic inference functions (QIF, Qu, Lindsay, and Li, 2000), which has better asymptotic properties than classic GEE. The focus is on selection using continuous penalties such as LASSO (Tibshirani, 1996) or SCAD (Fan and Li, 2001) rather than the older discrete penalties such as AIC and BIC. The asymptotic normality and efficiency (in the sense of the oracle property) of SCAD are demonstrated for penalized GEE and for penalized QIF, with the SCAD and similar penalties. This is demonstrated both in a fixed-dimensional and a growingdimensional scenario.
منابع مشابه
Model Selection for Correlated Data with Diverging Number of Parameters
High-dimensional longitudinal data arise frequently in biomedical and genomic research. It is important to select relevant covariates when the dimension of the parameters diverges as the sample size increases.We propose the penalized quadratic inference function to perform model selection and estimation simultaneously in the framework of a diverging number of regression parameters. The penalize...
متن کاملPenalized Bregman Divergence Estimation via Coordinate Descent
Variable selection via penalized estimation is appealing for dimension reduction. For penalized linear regression, Efron, et al. (2004) introduced the LARS algorithm. Recently, the coordinate descent (CD) algorithm was developed by Friedman, et al. (2007) for penalized linear regression and penalized logistic regression and was shown to gain computational superiority. This paper explores...
متن کاملPenalized quadratic inference functions for single-index models with longitudinal data
In this paper, we focus on single-index models for longitudinal data. We propose a procedure to estimate the single-index component and the unknown link function based on the combination of the penalized splines and quadratic inference functions. It is shown that the proposed estimation method has good asymptotic properties. We also evaluate the finite sample performance of the proposed method ...
متن کاملEffect of Probiotics on Infantile Colic Using the Quadratic Inference Functions
Background: Infantile colic is defined as episodes of extreme and excessive crying due to unknown causes. Various results have been reported regarding the management of colic with probiotics in terms of effectiveness, with no side effects or health risks in the infants. The present study aimed to evaluate the effect of probiotics on the infants with colic using the quadratic inference functions...
متن کاملAutomatic Variable Selection for High-Dimensional Linear Models with Longitudinal Data
High-dimensional longitudinal data arise frequently in biomedical and genomic research. It is important to select relevant covariates when the dimension of the parameters diverges as the sample size increases. We consider the problem of variable selection in high-dimensional linear models with longitudinal data. A new variable selection procedure is proposed using the smooth-threshold generaliz...
متن کامل